Neighboring Digits Pattern Training Method in Quickly-spoken Connected Mandarin Digits Speech Recognition

نویسندگان

  • Chunyi Guo
  • Runzhi Li
  • Ming Fan
  • Kejun Liu
چکیده

Deletion errors are most usually occurred in connected Mandarin digit string speech recognition when speaking rate is fast, and are the main reasons leading to the increasing of the recognition error rate and the decline of the recognition accuracy. In this paper, a new training method named neighboring digits pattern is given based on sufficient statistics of recognition errors of the traditional system in order to eliminate most of deletion errors which seriously affect the system recognition rate. The training process is presented and the performance evaluation is given. The result analysis demonstrates that the new method can reduce the deletion errors effectively and improve the system recognition rate from 96.4% to 98.3%.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Efficient Method for Removing Deletion Errors in Quickly-spoken Connected Mandarin Digit String Speech Recognition

Connected Mandarin digit string speech, especially at rapid spoken rate, is very difficult to recognize correctly. In this paper, a new training method named neighboring digits pattern is proposed in order to eliminate most of deletion errors which frequently occur in Mandarin digits speech recognition at high speaking rate when we have enough quickly-spoken speech data as the training set. The...

متن کامل

Improvement in Connected Mandarin Digit Recognition by Explicitly Modeling Coarticulatory Information

The most successful training scheme for recognition of connected spoken digits is the segmental k-means algorithm, which implicitly captures the coarticulatory information of connected speech iteratively to establish reliable reference patterns. However, when this algorithm is applied to Mandarin digits, the obtained performance is inferior to that of English. Hence, a novel approach is propose...

متن کامل

Mandarin connected digits recognition for whispered speech

In this paper, the acoustic characteristics and recognition of whispered speech are discussed. A Mandarin digits database is built both in normal speech and whispered speech. The collected speech materials of normal and whispered speech are analyzed to verify the characteristics and differences for the two kinds of speech. Cross recognition is carried out using normal and whispered speech as tr...

متن کامل

Recognizing connected digits in a natural spoken dialog

This paper addresses the general problem of connected digit recognition in the telecommunication environment. In particular, we focus on a task of recognizing digits when embedded in a natural spoken dialog. Two di erent design strategies are investigated: keyword detection or word spotting, and large-vocabulary continuous speech recognition. We will characterize the potential bene ts and descr...

متن کامل

Speech Recognition System of Arabic Digits based on A Telephony Arabic Corpus

Automatic recognition of spoken digits is one of the difficult tasks in the field of computer speech recognition. Spoken digits recognition process is required in many applications such as speech based telephone dialing, airline reservation, automatic directory to retrieve or send information, etc. These applications take numbers and alphabets as input. Arabic language is a Semitic language tha...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of Multimedia

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2011